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THE  EXPECTED  TIME  COMPLEXITY  OP  PARALLEL 


GRAPH  AND  DIGRAPH  ALGORITHMS 


by 

John  Reif*  and  Paul  Spirakis 
Aiken  Computation  Laboratory 
Division  of  Applied  Sciences 
Harvard  University,  Cambridge,  Massachusetts 


SUMMARY 

This  paper  determines  upper  bounds  on  the  expected  time  complexity 
for  a  variety  of  known  parallel  algorithms  for  graph  problems - 

For  connectivity  of  both  undirected  and  directed  graphs,  transitive 
closure  and  all  pairs  minimiam  cost  paths,  we  prove  the  expected  time  is 
O(loglogn)  for  a  parallel  RAM  model  (RP-RAM)  which  allows  random  resolution 
of  write  conflicts,  and  expected  time  0 (log n loglog  n)  for  the  P-RAM  of 

[wyllie,  79],  which  allows  no  write  conflicts. 

We  show  that  the  expected  parallel  time  for  biconnected  components  and 

2  2 
minimum  spanning  trees  is  0{  (loglog  n)  )  for  the  RP-RAM  and  O(log  n  •  (loglog  n)  ) 

for  the  P-RAM. 

Also  we  show  that  the  problem  of  random  graph  isomorphism  has  expected 
parallel  time  O(loglogn)  and  0{logn)  for  the  above  parallel  models, 
respectively. 

Our  results  also  improve  known  upper  bounds  on  the  expected  space 

required  for  sequential  graph  algorithms.  For  example,  we  shown  in  Section  7 

that  the  problems  of  finding  strong  components,  transitive  closure  and  minimum 

0(1) 

cost  paths  have  expected  sequential  space  0 ( log • loglog  n)  with  n  time 

on  a  Turing  Machine  given  random  graphs  as  inputs. 

This  work  was  supported  by  the  National  Science  Foundation  Grant  NSF-MCS79*’ 

21024  and  the  Office  of  Naval  Research  Contract  N00014-80-C-0074 . 
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1.  INTRODUCTION 

1.1  The  Problems 

Considerable  work  has  been  done  on  sequential  graph  algorithms  which  are 
fast  on  the  average  given  random  input  graphs.  This  includes  the  work  of 
[Angluinr  Valiant,  ,  [Karp,  76],  [Karp,  Sipser,  81],  [Schnorr,  78]  ,  [Karp, 
Tarjan,  80],  *[Reif,  Spirakis,  80],  [Spirakis,  81].  Almost  no  previous  work 
examined  the  average  performance  of  parallel  graph  and  digraph  algorithms. 

We  analyze  here  the  average  performance  of  parallel  algorithms  for 
connectivity,  biconnected  components,  strong  connectivity  and  transitive 
closure,  minimum  cost  spanning  tree,  minimum  cost  all  pair  shortest  paths 
and  graph  isomorphism. 

1.2  The  Parallel  Machine  Models 

We  consider  here  some  fundamental  models  of  parallel  computation,  all 
of  which  assume  the  presence  of  an  unlimited  number  of  processors.  In  the 
first  model,  P-RAM  of  [Wyllie,  79],  each  processor  is  capable  of  performing 
arithmetic,  boolean  and  certain  read  and  write  operations.  The  processors 
have  access  to  a  common  main  memory.  In  the  P-RAM,  different  processors  can 
read  the  same  memory  location  at  the  same  time.  They  may  store  infonnation 
at  different  memory  locations  simultaneously,  but  no  two  processors  can 
attempt  to  change  the  contents  of  the  same  memory  cell  at  the  same  time. 

(Reif,  81]  presented  a  probabilistic  P-RAM  model,  PP-RAM,  where  processors 
are  capable  of  doing  independent  probabilistic  choices  on  a  fixed  input  (but, 
again,  simultaneous  writing  at  the  same  location  is  not  allowed) . 

In  the  second  model,  the  SP^^RAM,  simultaneous  access  to  the  same  memory 
location  is  allowed  for  both  read  and  write  operations.  In  the  last  case 
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exactly  one  processor  succeeds  but  we  make  no  assumption  of  which  one 
succeeds  (see  [Shiloah,  Vishkin,  80] ) . 

In  the  third  models  the  EP~-RAMt  again  we  allow  simultaneous  access  to 
the  same  memory  location  for  both  read  and  write  operations.  In  the  last 
case,  if  k  processors  attempt  to  write  at  the  same  time,  then  exactly 
one  succeeds.  The  probability  that  each  particular  one  succeeds  is  1/k, 

(One  could  imagine  that  processors  are  ordered  in  some  sequential  order  (all 
sequences  being  equiprobable)  and  then  each  subsequent  write  overwrites 
the  effect  of  the  previous  one  and  that  this  sequential  execution  occurs  in 
a  very  fast  way,  say  one  step.) 

It  is  clear  that  any  parallel  algorithm  for  the  SP-RAM  requires  the 
same  expected  time  on  the  RP-RAM.  It  is  also  clear  that  any  parallel  algo¬ 
rithm  in  the  RP-RAM  of  time  T  implies  a  parallel  algorithm  on  the  P-RAK 
model,  of  time  at  most  O(T.logm)  where  m  is  the  maximum  number  of 
processors  which  could  possibly  compete  for  the  same  memory  location  (one 
could  simulate  the  nondeterministic  choice  by  a  tree  of  pairwise  selections 
to  find  a  unique  winner) . 

1.3  New  Results  in  the  Theory  of  Random  Graphs 

The  input  to  most  of  the  algorithms  here  is  assumed  to  be  a  random 

graph  of  the  model  G  as  defined  in  [Erdos,  Renyi,  60],  or  a  random 

n,p 

digraph  D  as  defined  in  [Spirakis,  Reif,  81].  For  the  minimum  spanning 
n,p 

tree  and  shortest  paths  algorithms  the  graph  is  assumed  to  be  a  random  graph 

G  with  edge  weights  selected  from  a  continuous  distribution,  independently  of 
n,p 

each  other.  In  the  G  model  the  probability  of  existence  of  an  edge  p 

n,p 

is  ^c/n  where  c  is  a  constant  >1.  The  key  to  the  analysis  of  the 
algorithms  of  this  paper  is  some  new  results  on  random  graphs  (given  in  the 
Appendix) . 
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(1)  The  depth  of  a  random  graph  of  the  model  G  with  p>c/n  (or 

n^p 

G  „  with  N>cn)  is  O(logn)  (and  decreases  as  the  graph  density  becomes 
n  r  N 

larger) ,  with  high  probability. 

(2)  The  number  of  connected  components  of  a  random  graph  (or  digraph 

for  strong  connectivity)  of  the  model  G  with  p^c/n  (where  c>2)  is 

n,p 

O(logn)  with  probability  ^1-n  and  this  number  decreases  as  p  grows. 
(This  holds  also  for  k-blocks#  k>l  for  any  constant  k.) 

(3)  A  random  graph  of  density  p^c/n  where  c^2  has  a  large 

component  (of  the  appropriate  type)  of  size  ^n log  n  with  probability 
.  -2 

^1  -  n  .  A  previous  result  of  [Karp,  Tarjan,  80]  showed  only  that  there 
is  a  large  component  of  size  at  least  en,  e > 0,  with  high  probability. 

1.4  Expected  Time  Results  for  Known  Parallel  Graph  Algorithms 

Tlsing  ^b.e  above  fact-S  ‘^Qroe  non-trivial  average  analy*;-}*;, 

show  that  one  can  find  the  connected  components  of  a  random  graph  G 

n,p 

and  the  strong  components  and  transitive  closure  of  a  random  digraph  D 

n,p 

and  also  the  all-pairs  shortest  paths  of  a  weighted  random  graph  in  average 
parallel  time  O(loglogn)  in  the  KP-RAM  and  0 (log  n* loglog  n)  in  the 

P-RAM.  The  average  time  for  biconnected  components  and  minimum  spanning 

2  2 
tree  is  0( (loglog  n)  )  for  RP-RAM  and  0(log  n* (loglog  n)  )  for  the 

P-RAM.  The  average  time  for  graph  isomorphism  is  0 (loglog  n)  in  RP-RAM 

and  0(log  n)  in  P-RAM.  The  expected  number  of  processors  for  undirected 

2 

connectivity  is  0(*; - r:; - - - -) .  For  directed  connectivity  and 

log  n* (loglog  n) 

3 

transitive  closure  0(n  /log  n)  processors  are  expected.  For  biconnected 

3 

components  we  require  0(n  /log  n)  processors-  For  minimum  spanning  tree 

2  3 

and  all  pairs  shortest  paths,  we  require  0(n  /log  n)  and  0(n  /log  n) 

2 

processors.  For  graph  isomorphism  we  need  0(n  /log  n)  processors. 

Table  1  gives  our  result 
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We  remark  there  is  generally  a  considerable  drop  in  time  complexity 

when  one  concentrates  in  the  average  case  of  the  above  problems  rather  than 

worst  case  input.  For  sake  of  comparison  we  list  here  previous  results  and 

also  provide  a  table  (Table  2  ) •  The  best  known  deterministic  algorithm  for 

2  .  2  2 

graph  connectivity  is  O(log  n)  time  in  P-RAM  with  0(n  /log  n)  processors 

[Hirschberg,  Chandra,  Sarwate,  79  ]  and  O(log  n)  time  in  SP-RAM  with 

0(n+m)  processors  [Shiloah,  Vishkin,  30]*  [  JaJa,  78]  has  an  O(lognlogd) 

3 

time  and  0(n  /log  n)  processors  algorithm  for  the  P-RAM  where  d  is  the 

depth  of  the  graph.  For  the  biconnected  components,  [Savage,  JaJa,Sl  ] 

2  .  3 

give  an  0(log  n)  time  algorithm  in  deterministic  P-RAM  with  0(n  /log  n) 

2 

processors  and  also  a  O(log n  log  d  log  k)  time  algorithm  with  0(mn+n  log  n) 

processors  where  k  is  the  nximber  of  components  and  m =  edges  of  the  graph. 

2 

For  the  minimum  spanning  tree  [.Savage , Jaja,  81  ]  given  an  O(log  n)  time 
2 

algorithm  with  0(n  )  processors-  For  all  the  above  undirected  graph  problems, 

[Reif,  82  ]  gives  an  0(log  n)  probabilistic  algorithm  for  the  PP-RAM.  This 

time  is  slightly  lower  than  the  O (log n  loglog  n)  expected  time  bound  of 

this  paper  for  solving  undirected  graph  problems  on  P-RAM,  but  [Reif,  82  ] 

requires  an  O(n^logn)  number  of  processors  whereas  our  paper  requests  on 

the  average  considerably  less  processors,  on  the  P-RAM  model.  For  transitive 

closure  and  strong  components  the  best  result  ([JaJa,  78])  for  the  P-RAM  is 

3 

O(lognlogd)  time  with  0(n  /log  n)  processors.  For  all  pairs  shortest 

2 

paths  in  digraphs,  [Dekel,  Nassimi,  Sahni,  81]  give  an  0(log  n)  algorithm 
with  O(n^)  processors.  No  parallel  algorithm  for  isomorphism  was  known 
up  to  now.  Note  also  that  the  results  of  [Reif,  82]  are  not  applicable  to 


digraph  problems. 
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2.  EXPECTED  TIME  COMPLEXITY  FOR  PARALLEL  TRANSITIVE  CLOSURE  AND  STRONG 
CONNECTIVITY 

The  following  well  known  algorithm  for  computing  the  transitive  closure 

of  an  nxn  boolean  matrix  A  was  shown  in  [Dekel,  Nassimi,  Sahni,  81]  to 
2  3 

be  of  O(log  n)  worst  case  time  by  using  0(n  )  processors,  in  the  P-RAM 
model : 

Algoritlim  TC(A) 

B^A+  I 
L: 

B^B  •  B 

if  B*  t^B  then  go  to  L 

[JaJa,  78]  did  an  efficient  implementation  of  this  algorithm  showing  that  it 
he  done  in  OHog  n  Ion  d)  time  on  the  P-RAK  mode}  .,  and  by  nsino 
0(nVlog  n)  processors,  where  d  =  depth  of  G.  Note  that  we  can  drop  the 
number  of  processors  to  0(n^^^3^1og  n)  by  applying  Schonage-Strassen 
algorithm  for  matrix  multiplication.  It  is  easy  to  observe  that  the  basic 
set  operation  of  [JaJa,  78]  (i.e.,  the  union  of  0(n)  sets  of  0(n) 

integers  each,  from  1  to  n)  can  be  done  in  0(1)  time  in  the  SP-RAM,  im¬ 
plying  an  0(log  d)  algorithm  for  transitive  closure.  Using  our  results 

about  the  average  depth  of  D  (see  Appendix)  we  immediately  have: 

n,p 

THEOREM  2.1.  The  average  parallel  time  for  transitive  closure  of 
directed  graphs  is  0(loglog  n)  for  the  SP^RAM  (and  thus  also  the  RP-RAM) 
parallel  computers  and  uses  0(log  n-loglog  n)  for  the  P--RAM  model  using 
0(nVlog  n)  processors.  The  probability  that  the  parallel  time  is  more 
than  this  is  less  than  exp(-kn^)  for  k,a>0. 
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[JaJa,  78]  gave  also  an  algorithm  for  findinr  the  strong  components  of  a 

digraph  in  parallel  time  O(lognlogd)  in  the  P-RAM  model,  by  using  OCnVlogn) 

processors.  Our  results  about  the  depth  of  D  then  imply: 

n,p 

THEOREM  2.2.  The  average  parallel  time  for  finding  the  strong  components 
of  a  digraph  in  the  P-RAM  model  is  O(log  n  loglog  n) ^  using  0(nVlog  n) 
processors.  In  the  SP-RAM  model  (and  thus  also  the  RP-RAM)  the  average  time  is 
Odoglog  n)  using  the  same  number  of  processors.  The  probability  that  the 
parallel  time  is  more  than  thiSj  vs  less  than  exp(-kn  )  for  some  k,a>0. 

Note  that  the  same  closure  idea  can  be  applied  to  undirected  graphs  to 
compute  the  connected  components  in  O(lognlogd)  time  in  the  P-RAM  model  and 
0{log n loglog  n)  average  time,  as  [JaJa,  78]  remarks.  However,  the  number  of 
processors  used  is  wasteful  and  we  give  in  the  next  section  an  independent 
derivation  of  our  result  which  optimizes  over  the  number  of  processors. 

3.  EXPECTED  PARALLEL  TIME  FOR  CONNNECTIVITY  AND  BI CONNECTED  COMPONENTS  IN 
UNDIRECTED  GRAPHS 

3.1  Previous  Parallel  Algorithms  for  Undirected  Connectivity 

[Reif,  82]  provided  a  probabilistic  O(log  n)  time  algorithm  for  the  PP- 

3 

RAM  model,  but  requiring  0(n  log  n)  processors.  [JaJa,  78]  and  [Savage, 

JaJa,  81]  presented  algorithms  in  the  P-RAM  model  for  undirected  graph 

connectivity,  with  parallel  time  O(lognlogd),  where  d  is  the  graph  depth, 

3 

and  with  0{n  /log  n)  processors.  For  both  these  algorithms  the  number  of 

processors  is  wasteful  and  [Hirschberg,  Chandra,  Sarwate,  79]  presented  an 

2 

algorithm  which  runs  in  the  P-RAM  model  in  parallel  time  O(log  n)  and  only 
using  n[n/logn]  processors.  [Savage,  JaJa,  81]  remarked  that  this  algorithm 

is  really  of  time  O(log  n*y)  where  y  =  min{log  ^  is  graph  depth. 
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(Note  that  thus  our  result  that  the  expected  d  is  0{log  n)  does  not  imply 

2 

their  algorithm  has  expected  time  below  O(log  n)  .)  [Chin,  Lam,  Chen,  81] 

2 

improved  the  algorithm  of  Hirschberg  et  al,  to  run  in  O(log  n)  time  by  using 
2  2 

only  0(n  /log  n)  processors,  again  in  the  P-RAM  model.  [Shiloah,  Vishkin,  81] 
adopted  the  Hirschberg  et  dl.  algorithm  to  run  on  an  SP-RAM  computer  and 
provided  an  O(log  n)  algorithm  by  using  0(n+m)  processors,  where  m  is 
the  number  of  edges. 

3.2  Graph  Connectivity  in  O(loglog  n)  Average  Parallel  Time  in  the  RP-RAM, 

2 

Using  0(n  /log  n*log  log  n)  Processors 
We  prove  here: 

THEOREM  3.1.  The  average  garallet  time  complexity  for  connected  components 

of  the  random  graph  G  with  p^— ,  c>l,  is  O(loglog  n)  in  the  RP-RAM 

n,p  n 

models.  Thp.  nneraoe  number  of  r>roo.pj^f^nrs  uspd  i.p  o(n+pn)  .  The  prnhnhUity 
that  the  time  complexity  exceeds  Odoglog  n)  is 

We  shall  first  analyze  here  the  average  performance  of  the  [Shiloah, 
Vishkin,  81]  algorithm,  executed  in  our  RP-RAM  model-  The  algorithm  uses  2n+m 
processors,  hence  it  uses  n(2+pn)  processors  on  the  average.  An  improvement 
over  this  will  be  shown  later.  During  the  computation  each  vertex  v  has  a 
pointer  field  D(v)  through  which  it  points  on  another  vertex  or  to  itself. 

One  can  regard  v-^-DCv)  as  a  (directed)  edge  in  an  auxiliary  graph,  called  the 
pointers - graph ,  This  graph  is  a  collection  of  sets,  each  of  which  is  a  rooted 
tree  with  a  self-loop  at  the  roots.  As  the  algorithm  proceeds,  the  number  of 
sets  decreases  while  each  individual  set  increases  (or  disappears) .  This  is 
caused  by  a  "hooking"  operation  in  which  a  tree  is  hooked  on  another  tree. 

The  trees  are  also  subject  to  another  transformation  of  collapsing  towards  the 
root.  At  the  end,  each  remaining  set  is  a  connected  component  of  the  graph 
and  looks  like  a  rooted  star  in  the  pointers  graph. 
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3.3  Informal  Algorithm  Description  (for  details  see  [Shiloah,  Vishkin,  80]) 

The  notation  ”3  means  that  vertex  i  points  on  vertex  j  after 

the  s-th  stage.  Each  stage  has  four  steps,  and  each  step  takes  constant  time 
in  the  RP-RAM  model: 

Step  1:  For  all  vertices  i  i^^g  (collapsing  trees). 

Step  2;  Hook  trees  on  smaller  vertices  of  other  trees:  All  the  vertices 
that  have  pointed  on  a  root  at  the  end  of  the  previous  stage  check  whether 
their  neighbors  are  pointing  on  smaller  vertices.  If  one  finds  such  a  neighbor 
j,  it  tries  to  hook  its  tree  on  this  point  simultaneous  writing  at 

the  same  location  is  used  by  the  RP-RAM  and  one  succeeds  (with  equal  probability) . 
Let  us  call  a  tree  stagnated  in  the  s-th  stage  if  it  has  not  been  changed  in  the 
first  two  steps  of  the  stage.  A  root  of  such  a  tree  is  called  a  stagnated  root. 

Step  3 :  Hooking  stganated  trees:  All  the  vertices  that  point  on  a 
stagnated  root  check  whether  their  neighbors  point  on  a  vertex  of  another  tree. 

If  one  finds  such  a  vertex  j  then  it  tries  to  hook  its  tree  on  Dg(3)*  One 
of  them  eventually  succeeds - 

Step  4;  Second  collapsing:  (i)  (D^ (i) ) . 

We  allocate  processors  to  vertices  and  to  edges.  Each  edge  {i/j}  is  viewed 
as  two  pairs  <irj>r  <j,i>  and  we  allocate  one  processor  to  each  pair.  It  is 
clear  that  just  after  Step  1  or  after  Step  4  the  trees  are  rooted  stars. 

Stagnated  trees  are  also  rooted  stars  at  the  beginning  of  Step  3. 

Let  us  call  each  rooted  tree  a  "supervortex."  It  has  been  remakred  in  the 
literature  that  the  number  of  supervertices  per  connected  component  reduces  by 
at  least  two  in  each  stage,  thus  leading  to  a  total  of  O(log  n)  number  of 
stages  and  an  0(log  n)  parallel  time  algorithm  in  the  SP-RAM  model. 
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3 . 4  Expected  Time  Analysis 

We  assume  here  that  the  input  to  the  algorithm  is  an  instance  G  of  the 

random  graph  G  with  p>c/n  and  n>l.  We  shall  show  that  only 
n,p 

Odoglog  n)  stages  are  needed  on  the  average.  Let  us  assume  that  there  are 

m  supervertices  at  the  end  of  the  t-th  stage,  of  sizes  Sw--wS^,  Since 

1  m 

p^c/n,  by  results  of  the  Appendix  of  this  paper,  the  average  valence  of  each 

node  is  ^2c.  In  fact,  a  large  fraction  with  t  close  to  1)  of  nodes 

of  each  set  i  of  size  will  each  have  valence  ^2  with  probability 

tending  to  1  as  n  tends  to  (since  c  >  1) .  The  edges  out  of  each  node  are 

placed  in  a  random  way.  Let  us  consider  a  particular  set,  of  size  S^.  This  set 

has  a  total  of  2cS^  edges  on  the  average,  originating  out  of  its  vertices 

(it  has,  in  fact,  s?  edges  out  of  its  vertices  with  high  probability). 

The  conditional  probability  that  each  of  these  edges  hits  a  particular  other 
t  t 

set  of  size  is  obviously  Hence,  the  total  probability  that  the  set 

is  hit  is  approximately  (2cS^) (S^)/n,  conditioned  on  the  assumed  sizes 
and  valences.  (In  fact,  it  is  >(2eS^)  (S^/n)  with  high  probability).  Hence, 

X  X 

the  average  number  of  the  sets  to  which  the  set  of  size  has  edges  is 

/ n“S^  \ 

{2cS^)  —  =  2cS*^  ( - i- )>  2c(sN  1)  >  cS^ 

if  we  inductively  assume  that  2 < ^ Vn  for  every  i  and  any  stage  before 
the  end  of  the  algorithm.  So,  we  remark  that,  on  the  average,  each  set  (i.e., 
supervertex)  can  (in  the  next  stage)  potentially  merge  with  at  least  as  many 
supervertices  as  its  size,  multiplied  by  c.  Let  us  now  make  the  inductive 
hypothesis  that  all  (but  a  small  fraction)  of  the  sets  are  of  the  same  average 
size  (say  S^)  at  time  t.  For  each  set  of  size  S  then,  at  least  cS 
processors  propose  for  it  to  merge  with  some  other  set.  A  particular  processor 


-10- 


wins  with  probability  q«l/cS.  Thus,  the  average  number  of  proposals  a  set 

accepts  will  be  at  least  l/2q  =  cS/2.  (To  understand  the  above  argument, 

consider  an  undirected  graph  whose  vertices  have  degree  cS,  Each  vertex 

selects  one  edge  out  of  it  with  probability  1/cS.  Then,  the  average  indegree 

of  the  resulting  directed  graph  will  be  at  least  cS/2.)  Hence,  at  the  next 

t+l-th  stage,  the  size  of  that  set  (and  of  all  but  a  small  fraction  of  the 

cS  c  2 

remaining  sets)  will  be  at  least  S  +  S  •  —  ^  —  S  -  So,  we  have  shown  that,  at 

the  next  stage,  the  sets  will  have  again  equal  average  sizes  (except  for  a 

small  fraction)  and  that  ^  ^  (S^)^.  Let  t^  be  the  number  of  stages 

t^  t  * 

the  algorithm  does  on  the  average.  We  have  S  =  max^{s ,  }  ^  n,  implying 
t^^Odoglog  n)  ,  proving  our  claim  about  the  average  number  of  stages  of  the 
algorithm.  Note  that  the  above  arguments  can  be  used  to  prove  the  giant 
component  theorem  for  random  graphs  (see  [Karp,  Tarjan,  80),  [Reif,  Spirakis ,81] ) 
For  a  more  detailed  analysis,  the  following  statements  can  be  proved  by 
induction  on  the  number  of  iterations. 

(1)  The  size  of  each  of  the  sets  at  the  end  of  the  t-th  stage,  follows 

t  2^ 

approximately  a  Poisson  distribution  of  mean  (c/2)  *2 

2^ 

(2)  The  sizes  of  all  (but  a  small  fraction  equal  to  1-B  r  where  3<  1) 
sets  have  small  differences  with  probability  going  to  1  as 

Clearly,  these  two  statements  again  imply  that  the  mean  value  of  number 
of  stages  is  0(loglog  n)  and  that  the  probability  that  t >  log log  n  is  o(l/n), 


3.5  An  Improvement  Over  the  Number  of  Processors 


(Chin,  Lam,  Len,  81]  use  arrays  to  represent  the  supervertices  and  the 
selection  of  the  minimum  neighbor  in  Step  2  is  done  in  P-RAM  by  using  nk 
processors  in  total,  where  k^n/2. 


1 
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lt  is  shown  in  [Chin^  ham,  Len^  81]  that  the  minimum  of  n  elements  can 
be  computed  on  a  P-RAM  in  time  T  where 

T  *  fn/kl  -  1  +  log  k  for 

For  the  total  time  bound,  we  shall  have  (on  the  average) 

log^  n  /  N  \ 

■"total  =  s  + 

x=0  V  ' 

1  2^ 

where  N.  =  (c/2)  .2 
1 

<  oj—  +  (loglog  n)log  kj 

The  optimal  value  of  the  total  time  is  O(logn  loglog  n)  by  using  an  expected 

2 

number  of  nk = n  /log  n- (loglog  n)  processors.  Note  that  we  could  get  the 
same  average  time  complexity  for  the  P-RAM  model  by  just  simulating  our  algo¬ 
rithm  for  the  RP-RAM  model  but  this  would  require  us  to  use  an  average  of 
n (1+pn)  processors. 

COROLLARY  4.1.  The  average  parallel  time  for  connected  components  of  the 

random  graph  ^  with  p>c/n  is  O(log  n'loglog  n)  in  the  F-RAM  model  and 

2 

the  number  of  processors  needed  is  0{n  /log  n*  loglog  n)  on  the  average. 

3.6  Biconnected  Components 

[JaJa,  78]  provided  an  algorithm  which  uses  the  P-RAM  model  and  finds  the 
biconnected  components  of  a  graph  in  parallel  time  0{log  n*log  d‘log  k)  where 
d  is  the  depth  of  the  graph  and  k  is  the  number  of  biconnected  components. 

3 

The  number  of  processors  needed  is  0(n  /log  n) .  Again,  this  algorithm  can  be 

implemented  on  the  P*P-RAM  model  to  run  in  O(log  d*log  k)  parallel  time  with 
3 

0(n  /log  n)  processors.  Our  results  of  the  Appendix  about  d  and  k  imply  then; 
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COROLLARY  3.2.  The  expected  pai*allel  time  complexity  fox*  finding  biconnected 

2 

components  in  the  random  graph  G  with  p'^c/n,  c>5,  is  0{(loglog  n)  ) 

n  /  p 

2 

for  the  HP-RAM  model  and  O(log  n-  (loglog  n)  )  for  the  P-^RAM^  The  nwnher  of 
processors  needed  is  O(n^/log  n) .  The  probability  that  the  parallel  time  is 
more  than  th^s  is  0(2n  ). 


4.  EXPECTED  PARALLEL  TIl'^  FOR  MINIMUM  WEIGHT  SPANNING  TREE 


4.1  The  Algorithm  of  Sollin 

We  consider  here  the  problem  of  finding  the  minimum  weight  spanning  tree 

of  instances  of  the  random  graph  G  ,  p^c/n,  the  edges  of  which  have  positive 

n,p 

costs  drawn  independently  from  a  continuous  distribution  over  a  positive  domain. 

We  apply  here  Sollin's  algorithm  (see  [Papadimitriou,  77]).  This  algorithm 

3 

was  previously  shown  in  [JaJa,  78)  to  have  worst  case  parallel  time  O(.log  n) 
on  a  P^RAM.  The  algorithm  is  based  on  the  following  observation;  Let  (V,T) 
be  the  minimum  spanning  tree  (MST)  of  the  graph  G-  (V,E)  under  an  edge  cost 
mapping  c;E-^lR^  Then,  for  every  v€v,  e^GT,  where  e^  is  among  the  shortest 
of  the  edges  incident  to  vertex  vGV. 

Sollin* s  algorithm  begins  by  finding  all  minimum-cost  edges  incident  upon 
each  node  of  G.  These  edges  will  constitute  a  forest  if  the  cost  function  is 
one-one.  This  is  so  in  our  case  because  costs  are  drawn  from  a  continuous 
distribution.  We  find  the  connected  components  of  that  forest  and  identify 
the  vertices  of  each  component.  Prom  here  on,  Sollin* s  algorithm  applies  the 
same  process  to  the  graph  whose  vertices  are  the  components.  The  algorithm 
proceeds  until  one  vertex  remains. 


Let  us  consider  the  SP-PAM  model  first.  We  can  find  the  minimum  edge 
incident  to  each  vertex  in  time  O(n/p  +  loglog  p)  by  using  np  processors. 

For  p  =  n/loglog  n  we  get  parallel  time  Odoglog  n)  and  need  n  /loglog  n 
processors  for  this  stage.  For  any  particular  vertex  pair  {u,v},  the  probabi¬ 
lity  that  this  edge  appears  in  the  forst  of  the  minimum  edges  is  P'prob{{u,v} 

has  min  cost  of  all  edges  out  of  u,  given  {u,v}  appeared}  +  p*prob{{u,v} 

has  minimum  cost  of  all  edges  out  of  v,  given  {u,v}  appeared}.  Since  weights 
are  assigned  independently,  one  can  show  that  prob{{u,v}  has  minimum  cost  of 
all  edges  out  of  u}  ^  1/c  for  p>c/n-  Hence,  probability{{u,v}  appears  in 
the  forest}  >  2/n.  So,  the  constructed  forest  has  embedded  into  it  the 
instances  of  a  random  graph  G  ^  with  p' >2/n.  By  the  results  of  Appendix 

of  this  paper,  the  forest  will  have  a  big  component  of  cardinality  >n-log  n 

2 

and  a  small  O(log  n)  number  of  smaller  components,  with  probability  >1  -  1/n  . 

The  construction  of  the  new  graph  (of  the  "nodes"  being  the  components  of 
the  original  with  respect  to  minimum  edges)  will  hence  take  Odoglog  n)  time 
in  the  SP-RAM  (and  also  RP-RAM)  model,  with  high  probability.  To  determine  the 
edges  of  this  reduced  graph  will  take  O (loglog  n)  time  for  each  vertex 
then  0 (loglog  n)  time  at  most  for  all  pairs  of  components.  The  connected 
components  of  the  new  graph  can  be  found  by  a  deterministic  algorithm  in 

2 

O(loglog  n)  time.  This  process  will  continue  giving  us  an  0( (loglog  n)  ) 

—2 

time  algorithm  with  probability  >  1-n  .  Hence: 

THEOREM  4.1.  The  parallel  expected  time  to  find  the  MST  of  a  random  graph 

G  with  random^  independent  weights  in  the  SP^RAM  (and  so  the  RP-^RAM)  models 

n  ,p 

2 

ia  0(  (loglog  n)  ) .  By  airmlatiorit  the  expected  parallel  time  in  the  P-FiAM  is 

2  2 
Odog  n*  (loglog  n)  ).  The  number  of  processors  heeded  is  0(n  /loglog  n) . 

The  probability  that  the  time  is  more  than  stated,  is  0(n  ). 
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5.  EXPECTED  PARALLEL  TIME  FOR  ALL  PAIRS  SHORTEST  PATHS 


Given  a  weighted  n-vertex  graph  G,  the  all  pairs  shortest-path  matrix  A 

is  an  nxn  matrix  such  that  A(i,j)  is  the  length  of  a  shortest  path  from 

k 

i  to  j  in  G.  Let  A  (i,j)  denote  the  length  of  a  shortest  path  from  i 

d 

to  j  going  through  at  most  k-intermediate  vertices.  Clearly  A(i,j)  =  A  (i,j) 
where  d  =  depth  of  G.  Let  A^{i,j)  be  the  length  of  the  edge  {ifj}  if 

{i,j}  is  in  the  graph,  and  + »  else.  It  is  easy  to  see  that  A  (i,j)  = 

k/2  k/2  d  2 

min  {a  (i,m)  +A  (m,j)}.  Hence  we  can  compute  A  by  computing  A  , 

m 

4  2^  d  k  k/2 

A,...,A  ...A.  A  can  be  computed  from  A  '  using  the,  matrix  multiplica¬ 

tion  algorithm  with  +  substituted  for  *  and  min  for  +.  Since  matrix 
multiplication  can  be  done  in  time  O(log  n)  in  P-RAM  using  n^/log  n 
processors  and  in  time  O(loglog  n)  in  SP-RAM  (and  so  RP-RAM)  using  n^/log  n 
processors,  we  get  an  O(log  n  log  d)  deterministic  time  for  P-RAMs  and  an 

O((loglog  n)’log  d)  for  random  graph  G  with  p^c/n  and  c>l  we  get 

n  ,p 

by  our  results  on  the  depth  of  G  :  (See  Theorem  AI  of  the  Appendix) . 

n,p 

COROLLARY  5.1.  The  expected  parallel  time  for  all  pairs  shortest  paths 

.2 

in  the  G  model  with  p^c/n,  c>l  and  arbitrary  weights  is  O((loglog  n)  ) 
n  ,p 

for  the  SP~RAM  or  RP-RAM  and  O(log  n- (log log  n))  for  the  P-RAM  model.  The 

-kn^ 

probability  that  the  time  is  more  than  stated  above  is  o(e  ),  for  k,a>0. 

O 

The  number  of  processors  needed  is  o(n  /log  n) . 

2 

Mote  [Dekel,  Nassimi,  Sahni,  811  gave  an  O(log  n)  deterministic  algorithm 


for  the  P-RAM  model  which  uses  n  processors. 


•  df  .FV 
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EXPECTED  PARALLEL  TIME  FOR  RANDOM  GRAPH  ISOMORPHISM 


The  following  algorithm,  given  by  [Babai,  Kucera,  79]  is  essentially  a 

Breadth-First-Search  procedure  applied  to  random  graphs  of  the  model  G  with 

n,p 

P=l/2: 

[1]  Classify  vertex  by  valences  (i.e.,  each  vertex  gets  its  valence  as  a 

label).  Let  c  *...,c  be  the  classes  of  the  above  classification,  arranged 
i  h 

by  the  order  induced  by  this  classification. 

[2]  (First  refinement) :  Let  N. (v)  be  the  number  of  neighbors  of  v  in 

4 

c^  and  let  N^(v)  = smallest  nonnegative  integer  congruent  to  (v)  mod  4. 

Then,  two  vertices  v,u  are  ordered  now  if  (a)  they  were  of  different  label 

before  or  (b)  they  had  same  Icibels  but 

(N^  (v)  , . . .  ,N^  (v) )  <  (N^  (u)  , . . .  ,N^  (u) ) 

1  h  1  h 

lexicographically. 

[3]  Let  c^rC^# • • . be  the  classes  of  [2].  Apply  step  [2]  to  these 
classes. 

THEOREM  6.1.  iBabai,  Kucera,  79].  Let  U  be  the  set  of  vertices  whose 

class  is  not  a  singleton  after  step  [3].  Then  Prob{|ul  >1}  <exp(-cn) ,  c>0. 

The  valence  classification  can  be  done  in  Odoglog  n)  time  by  SP-RAM  (and  thus 

also  the  RP~RAM)  by  using  a  total  of  n  /loglog  n  processors.  In  P-ZMAf,  it 

2 

occn  be  done  in  time  O(log  n)  by  using  o(n  /log  n)  processors.  The  partial 

orders  created  by  the  refinements  are  best  stored  as  a  functiony  corresponding 

to  them.  Clearly  a  function  computation  is  not  more  than  Odoglog  n)  in 

SP-RAM  (and  thus  also  the  RP-RAM)  and  O(log  n)  in  P-RAM  by  using  0(nVloglog  n) 

2 

and  0(n  /log  n)  processors,  respectively.  So: 

COROLLARY  6.1.  A  canonical  labelling  algorithm  on  p,  p = 1/2,  takes 
average  parallel  time  Odoglog  n)  in  SP-RAM  (and  thus  also  the  RP-RAM)  with 
0(n  /loglog  n)  processors  and  O(log  n)  time  in  P-W/V  with  o(n  /log  n)  pi^ocessoi^s. 
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7.  EXPECTED  SEQUENTIAL  SPACE  COMPLEXITY  FOR  GRAPH  ALGORITHMS 

THEOREM  7.1.  The  upper  bound  on  required  sequential  space  for  transitive 
closure  on  digraphs  is  O(log  n  log  d)  where  d  is  the  depth  of  the  digraph. 

Proof.  The  proof  is  similar  to  the  techniques  used  by  [Savitch,  70]  for 
simulating  nondeterministic  space  by  deterministic  space. 

Input:  digraph  D  =  (V,E),  |v|  =  n  . 

The  following  procedure  tests  if  there  is  a  path  from  u  to  v,  of  length 

>Z>  0. 

Algorithm  CONNECT ( u , v , ^ ) 
began 

if  (u,v)  €  E  then  return  TRUE  else 
begin 

for  all  w  €  V 

CONNECT (u,v,[ Vs])  and  CONNECT (w,v, [ £/2 1 
then  return  TRUE 

end 

It  is  clear  that  we  get  the  transitive  closure  by  repeated  applications  of 
CONNECT.  Each  time  we  just  remember  the  recursion  depth.  Therefore,  the 
required  sequential  space  is  O(lognlogd)  where  d  =  depth  of  d.  □ 

COROLLARY.  The  average  sequential  space  for  transitive  closure  is 
O(log n loglog  n) .  The  probability  that  is  more  than  stated  is  0(exp(-kn^) ) . 

Note  that  strong  components,  minimum  cost  paths  etc.  require  only 

Odognlogd)  sequential  space  in  the  worst  case,  implying  an  average  of 

O(log n loglog  n)  for  random  graphs  G  or  digraphs  D 

n,p  n^p 
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APPENDIX 


RESULTS  OF  THE  THEORY  OF  RANDOM  GRAPHS 


A, 1  The  Distribution  of  the  Depth  of  a  Random  Graph 

Let  the  depth  d(G)  of  a  graph  G=  (V,E)  to  be  the  max  ^,,{d (u,v)  ,2} 

u ,  VC  V 

where  d(u,v)  is  the  length  of  the  shortest  path  between  u  and  v,  if  they 

are  connected  otherwise) .  The  depth  of  a  random  graph  G  is  the  random 

n,p 

variable  d(G  )  whose  values  are  the  depths  of  the  instances  G  of  G 

n,p  n,p 

The  above  definition  generalizes  to  digraphs  in  an  obvious  way.  We  now  give  an 
average  case  argument  about  d  and  then  we  discuss  d*s  distribution. 

LEMMA  1,  The  average  value  of  d(G  )  is  O(log  n)  for  G  with 
^  n,p  n,p 

p  ^  c/n ,  and  c  >  1 . 


Proof.  Let  G=  (V/E)  be  an  instance  of  G  and  let  u  be  a  particular 
- -  n,p 

node  of  G.  Assume  we  do  Breadth-First-Search  out  of  u  in  parallel,  so  that 
nodes  at  the  same  distance  from  u  are  reached  at  the  same  time  t.  Let 
be  #  nodes  at  depth  t,  i.e.  reached  at  time  t.  Let  be  the  average  size 

at  time  t  of  the  set  of  visited  nodes.  Clearly 


t+1 


=  +  B  , 

t  t+1 


and 


B 


t+1 


pB^(n-  S^). 


From  these  two  equations  one  can  prove  (by  induction)  that 

1^ 

and  Sj^  =  n(l  +  c+*.*+c  )  for  p^c/n,  c>l.  Clearly,  S 

average  depth  of  G  .  Hence  d  =  0(log  n) . 
n,p 


d 


^  n 


for  d  =  the 


LEMMA  2.  The  probabtHty  that  d(G  )  >  log  n  ^e  bounded  above  by 

-kn“  n,p 

0(e  )  for  k,a  positive  constants  depending  on  for  graphs  G  irit} 

Xifp 

p^c/n,  c>l.  This  implies  that  d  =  o{log  n) . 

Proof.  Consider  two  specific  vertices  u,v  of  V.  The  probability  of 

no  path  of  length  1  between  u^v  is  =  1-p.  The  probability  of  no  path  of 

2  n^  2 

length  2  is  “  P  )  since,  for  a  path  of  length  2  to  exist,  at  most 

n-2  possible  nodes  in  V --  {u,v}  can  participate,  each  such  path  has  probabi- 
2 

lity  p  to  appear  and  these  paths  have  no  edges  in  common. 

The  probability  of  no  path  of  length  j  is  (for  j  > 1) 

(1-p^) (n-3) , . . (n-j) 

since  each  such  path  has  prob  =  p^  to  appear  and  there  are  n-2  independent 
choices  for  the  first  vertex  after  u,  n-3  choices  for  the  second  vertex,  etc. 
The  probability  that  there  is  no  path  of  length  log  n  for  the  specific  pair 
of  vertices  u,v  is 

As  A  A  A  /*!  WT  2-n-2  log  n.  (n-2)  . , .  (n-log  n) 

^  ‘*’2  ^ogn  =  (1-P){1-P)  -.-(l-P  ) 


For  p^c/n  and  p  =  o(l)  we  have  l/p-^®®  as  n*^®®  and  hence 


k, (n-2)... (n-k)  ^  -p^ (n-2) . . . (n-k) 

(1-p  )  e 


For  p  =  o(l)  we  also  have  1-p-^l  as  n-**®®.  So, 


$  e 

u,v 


*“Pl  (n-2)p  +  . . .  +  (n-2) . . .  (n-log  n)p^^^ 

r.  ,  %  /  ,  xlog  n  log  n, 

<  (n-log  n)p  +  . . .  +  (n-log  n)  p  J 


[(n-log  n)p]^°^  "-1  _ 
(n-log  n)p-l  * 


A3 


-11 

This  expression  is  bounded  above  by  e  '  for  c>l#  But 


2  - 1] 

Prob{d(G)  > log  n}^n$  <ne  ^ 

u  f  V 


If  c>l  then  3k, a  depending  on  c  (k>0,a>0)  such  that,  for  large  n 


2  -n 
n  e 


This  proves  the  Lenuna. 

Note  that,  for  pn">*°®  as  n*^®°,  one  can  prove  that  the  d(G)  ^ 


log  n 
log (pn) 


with  probability  tending  to  1.  For  p  ^  c  ^ ,  there  is  almost  always  a 

n 

path  of  length  ^  3  between  any  two  nodes  of  G  .  (The  reason  is  that, 

n,p 

for  any  pair  of  vertices  u,v  of  V,  if  S  ,  (S  )  are  the  sets  of  vertices 

u  v 

which  are  neighbors  of  u  (of  v)  but  not  of  v  (not  of  u)  then  [s  |  = 

— 6n 

|s^|^e*n  for  some  e>0  with  probability  ^1  -  e  ,  3  >  0,  Then,  a  lemma 

of  (Erdos,  Renyi,  59]  (also  (Karp,  Tarjan,  80])  proves  that  at  least  one  edge 

^  -3  *  n 

connects  a  node  of  S  and  a  node  of  S  with  probability  ^  1  -  e 

u  V  ^ 

o< 3*  < e.) 

We  will  quote  here  some  results  about  the  valence  of  nodes  of  G  ,  stated 

n,p 

in  (Erdos,  Renyi,  59] . 

REMARK  1.  If  p = c/n  then  the  degree  of  any  given  vertex  of  an  instance 

of  G  has  mean  value  2c  and  the  niamber  of  vertices  having  degree  r  is 
n,p 

approximately 

n(2c)^ 


REMARK  2.  If  p(n)  ^  w(n)  where  w(n)  as  n->-“  then  the 

n 

probability  that  the  degree  of  a  vertex  will  be  outside  the  interval 


C  w  (n ) 

2  log  n(l±e)w(n)  is  approximately  0{l/n  )•  Hence  the  probability 

that  the  degrees  of  not  all  n  points  will  be  between  2w(n) (1 ± e)log  n,  is 
tending  to  0. 

Note  that,  by  an  elementary  application  of  Whitney's  theorem,  the 

connectivity  k  and  depth  d  of  a  graph  satisfy  k(d-l)<n,  if  the  graph  is 

connected.  This  implies  that  with  high  (conditional  on  the  fact  that  G 

n  ,p 

is  connected)  probability,  the  graph  G  will  be  highly-connected  for 

n,p 

large  values  of  p. 

Note  that  all  of  the  above  results  can  be  easily  generalized  for  random 
digraphs  ^ ,  p  >  c/n ,  c  >  2 . 

A. 2  The  Size  and  Number  of  Connected  Components  for  p  ^  c/n 

THEOREM  A.  2.  For  any  m  =  o(n)  there  is  a  constant  >  i  ^  fimction 

t(n)  >  c^  log  n/m  such  tkat^  if  p>t(n)/n  then  if  X  is  the  cardinality  of 
the  largest  component  of  p  then 

Prob{x  <  n  -  m}  ^  —  ->  0  as  n  ^  . 

t(n)-m 
e 

Proof.  Assume  that  in  the  instance  G  of  G  the  cardinality  X  of 
-  n,p 

the  largest  component  satisfies  the  inequality  X^n-m.  Then,  we  can  find 
two  sets  Y,  Z  such  that  |Y|=m,  [zj^n-m  and  no  edge  between  them. 

This  event  is  above  bounded  by  the  probability  1-q  where 

q  s=  Prob{for  every  pair  of  disjoint  sets  Y,Z  of  vertices  of  sizes 
m,  n-m,  there  is  at  least  one  edge  between  Y,z}. 

Let  us  enumerate  all  possible  pairs  of  sets  of  vertices  of  the  above  sizes. 


Call  them 


where 


A5 


(Y,#Z  ,Z  ) 

1  1  2  2  9  9 


'  (:)(-)  ■  W 


We  have  that 

q  =  Prob{E(Yj^,Zj^)  ^  4)A  ...  AE(Yg,Zg)  (|)} 
where  E{Y,Z)  = set  of  edges  between  Y,Z.  So,  by  Baye's  formula, 


j  E(Y2,Z2 )?'<{») 


Prob{E(Y^,Z^)^(l)}.Prob  j.(y-7z,)?^?  •••  - A - 


(  E(Yj^,Zg.)^<|. 

E(Y^,Z.  )?<<() 


We  need  the  following  enumeration  lemma: 

LEMMA  3.  Fov  every  two  sets  Y^,Z^  having  at  least  one  edge  e  between 
them^  there  are  at  least 

pairs  of  sets  of  sizes  m,  n-m  which  also  have  this  edge  between  them. 

This  lemma  can  be  easily  shown  by  taking  out  the  two  vertices  of  e  and 
enumerating. 

As  a  corollary,  we  conclude  that  there  is  a  suitable  enumeration  of  the 
sets  in  the  q  product  such  that  for  every  term  i  not  equal  to  1  the  next 
g^  or  more  terms  (conditioned  on  the  existence  of  an  edge  from  A^  to  B^) 
will  be  equal  to  1.  Hence,  the  value  of  q  is 

g/g. 


q  >  [Prob{E(Yj^,Z^)  ^  <|)} 


But 


proof  of  that. 


